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FOR 

UNITED STATES LETTERS PATENT 

To the Assistant Commissioner of Patents: 

Be it known that we PETER B. DERVAN and ELDON J. BAIRD 
have invented certain new and useful improvements in 

"INHIBITION OF MAJOR GROOVE DNA BINDING PROTEINS BY MODIFIED 

POLYAMIDES" 



of which the following is a specification: 



15 



t 



Inhibition of Maior Groove DNA Binding Proteins Bv Modified P olvamides 

The U.S. Government has certain rights to this invention pursuant to Grant Nos. 
GM 26453, 27681, and 47530 awarded by the National Institute of Health. 
5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This applicatioois a cxratinuation-in-part of PCT/US97/03332 filed February 20, 
1997, Serial No. 08/853?522 filed May 8, 1997 and PCT/US97/12722 filed July 21, 1997 
which are continuation-in-part applications of Serial No. 08/837,524 filed April 21, 1997 
10 and Serial No. 08/607,078 filed February 26, 1996; provisional application 60/042,022, 
filed April 16, 1997; provisional application 60/043,444 filed April 8, 1997; 
PCT/US98/[MBHB 97,853] filed January 21, 1998; PCT/US98/[MBHB 97,854] filed 
January 29, 1998; and, PCT/US98/[MBHB98,016], filed January 29, 1998. The 
specifications of these applications are incorporated herein by reference. 



BACKGROUND OF THE I NVENTION 



Field of the Invention 

This invention relates to polyamides which bind to pre-determined sites of the 
20 minor groove of double-stranded DNA and have an a-amino acid domain ("positive 
patch") capable of inhibiting the activity of major groove DNA-binding proteins. 



Background of the Invention 

Polyamides containing tf-methylpyrrole (Py) and AT-methylimidazole (Im) amino 
25 acids bind to predetermined sequences in the minor groove of DNA with affinities and 

specificities comparable to naturally occurring DNA binding proteins (Trauger, et al. 

(1996) Nature 382, 559-561; Swalley, et al. (1997) J. Am. Chem. Soc. 119, 6953-6961; 

Turner, et al. (1997) J. Am. Chem. Soc. 119, 7636-7644). Sequence specificity is 

determined by a code of oriented side-by-side pairings of the Py and Im amino (Wade, et 
30 al. (1992) J. Am. Chem. Soc. 114, 8783-8794; Mrksich, et al. (1992) Proc. Natl. Acad. 

Sci. U.S.A. 89, 7586-7590; Wade, et al (1993) Biochemistry 32, 11385-11389; Mrksich, 
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et al. (1993) J. Am. Chem. Soc. 115, 2572-2576; White, et al. (1997) Chem. Biol 4, 569- 
578; White, et al. (1997) J. Am. Chem. Soc. 119, 8756-8765). An Im/Py pairing targets a 
G«C base pair, while Py/Im pair recognizes OG. The Py/Py pair is degenerate and 
targets both A«T and T*A base pairs (Pelton, et al. (1989) Proc. Natl. Acad. ScL USA 86, 
5 5723-5727; Chen, et al. (1994) Nature Struct Biol 1, 169-175; White, et al. (1996) 
Biochemistry 35, 12532-12537). The validity of the pairing rules for ligand design is 
supported by a variety of polyamide structural motifs which have been characterized by 
footprinting, affinity cleaving, 2-D NMR, and x-ray methods. The Py/Py pair is 
degenerate and targets both A»T and T«A base pairs. Polyamides have been found to be 
10 cell permeable and to inhibit transcription factor binding and expression of a designated 
gene (Gottesfeld, et al. (1997) Nature 387, 202-205; Nealy, et al. (1997) J. Mol Biol in 
press). Py/Im polyamides offer a potentially general approach for gene regulation, 
provided that efficient inhibition of DNA-binding can be achieved for a variety of 
transcription factors. 

15 Several approaches for the development of synthetic ligands which interfere with 

protein-DNA recognition have been reported. Oligodeoxyribonucleotides which 
recognize the major groove of double-helical DNA via triple-helix formation bind to a 
broad range of sequences with high affinity and specificity (Moser, et al. (1987) Science 
238, 645-650; Thuong, et al. (1993) Angew. Chem. Int. Ed. Engl 32, 666-690). Although 

20 oligonucleotides and their analogs have been shown to disrupt protein-DNA binding 
(Maher, et al. (1992) Biochemistry 31, 70-81; Duval-Valentin, et al. (1992) Proc. Natl 
Acad ScL U.S.A. 89, 504-508; Nielsen, P. E. (1997) Chem. Eur. J. 3, 505-508), the triple- 
helix approach is limited to purine tracts and suffers from poor cellular uptake. There are 
a few examples of carbohydrate-based ligands which interfere with protein-DNA 

25 recognition, but oligosaccharides cannot currently recognize a broad range of DNA 
sequences (Ho, et al. (1994) Proc. Natl. Acad. Set USA 91, 9203-9207; Liu, et al. (1996) 
Proc. Natl Acad. ScL USA 93, 940-944). Analogs of distamycin (PyPyPy) appended 
with multiple cationic substituents have been found to inhibit protein binding. Rational 
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design of tripyrrole peptides that complex with DNA by both selective minor-groove 

binding and electrostatic interaction with the phosphate backbone. (Bruice (1992) Proc. 

Natl Acad. Sci. USA 89, 1700-1704; Chiang, et al. (1997) Proc. Natl Acad. ScL USA 94, 

2811-2816; Bruice, et al. (1997) Bioorg. Med. Chem. 5, 685-692). Based on these 
5 encouraging results, we wished to identify similar charged residues which could be 

appended to a Py/Im polyamide via linear solid phase synthesis and would not 

compromise polyamide binding specificity. 

Proteins use a diverse structural library to recognize their target sequences (Steitz, 

T. A. (1990) Quart. Rev. Biophys. 23, 205-280). Proteins such as TBP bind exclusively 
10 in the minor groove (Kim, et al. (1993) Nature 365, 512-520), others, such as GCN4 
2 Oakley, M. G. & Dervan, P. B. (1990) Structural motif of the GCN4 DNA binding 

LJ domain characterized by affinity cleaving (Oakley, et al. (1990) Science 248, 847-850; 

J EUenberger, et al. (1992) Cell 71, 1223-1237; K6nig, et al. (1993) J. Mol. Biol. 233, 139- 

2 154), bind exclusively in the major groove, and certain proteins such as Hin recombinase 

P 15 recognize both grooves (Sluka, et al. (1990) Biochemistry 29, 6551-6561; Feng, et al. 

(1994) Science 263, 348-355). Polyamides have been found to interfere with protein- 

DNA recognition in cases where contacts in the minor groove are important for protein- 
Ty DNA binding affinity. For example, within the nine zinc-finger protein TFIIIA, fingers 4 

iQ and 6 bind in or across the minor groove and are required for high affinity binding (AT a = 

20 5 x 10 9 M" 1 ). An eight-ring hairpin polyamide - 3 x 10 10 M" 1 ) targeted to the 

minor groove contact region of finger 4 has been recently found to efficiently inhibit 

protein binding. 

X-Ray crystallography studies reveal that DNA bound by a 4-ring homodimeric 
polyamide is unaltered from its natural B-form structure, with all polyamide/DNA 
25 contacts confined to the minor groove (Kielkopf, et al. Nature Struct. Biol, in press). 
Polyamides have been shown to bind simultaneously with ligands that exclusively 
occupy the major groove (Oakley, et al. (1992). Biochemistry 31, 10969-10975; Park, et 
al. (1992) Proc. Natl Acad. ScL USA 89, 6653-6657). For example, an 8-ring hairpin 




polyamide and a recombinant protein containing only the three amino-terminal zinc 
fingers of TFTTTA which are in the major groove were found to co-occupy the TFIIIA 
binding site. Similarly, the three-ring homodimer ImPyPy bound simultaneously with the 
bZIP protein GCN4 (226-28 1 ). 
5 Intrinsic DNA curvature and protein induced DNA bending are also involved in 

the regulation of gene transcription, replication initiation, and other processes (Perez- 
Martin, J., et al. (1994) Microbiological Reviews 58, 268-290; Polaczek, et al. (1997), 
submitted). DNA is an inherently flexible polymer and neutral backbone analogs of 
DNA curve, where rigidity is maintained in natural DNA by coulombic repulsion 
10 between phosphates on the same strand Strauss, et al. (1994) Science 266, 1829-1834; 
Q Manning, G. S. (1983) Biopolymers 22, 689-729). Sequence-dependent curvature of 

y DNA is caused both by differential solvation in the minor groove and differential base 

p stacking leading to alteration of roll and tilt values (Dlakic, et a. (1996) J. Biological 

2 Chemistry 271, 17911-17919; Bolshoy, et al. (1991) Proa Natl Acad. ScL, USA 88, 

C 15 2312-2316). 

□ Proteins and other ligands that bend DNA alter the stacking of the bases by 

ill intercalation of hydrophobic groups, alter the effective Debye length of the surface 

Ijg through charge screening, or bend through energetic compensation for tight binding 

^ events. An example of a protein that seems to work through all three mechanisms in 

20 bending DNA >160 degrees is integration host factor (IHF) (Rice, et al. (1996) Cell 
87,1295-1306). Previously, it has been shown that artificial sequence specific DNA 
bending ligands can be designed that utilize bidentate tight binding third strand 
oligonucleotides to constrict the intervening duplex and bend DNA (Liberies, et al. 
(1996) Proc. Natl Acad. Scl, USA 93, 9510-4; Akiyama, et al. (1996) Proc. Natl 
25 Acad. ScL t USA 93, 1212212127; Akiyama, et al. (1996) J. Biological Chemistry 271, 
29126-29135; Akiyama, et al. (1997) Biochemistry 36, 2307-2315). 



Compounds that bind in the minor groove such as distamycin and DAPI have 
been shown to alter DNA rigidity (Larsson, et al. (1996) J. Physical Chemistry 100, 
3252-3263; McCarthy, et al. (1991) Nucleic Acids Research 19, 3421-9; Barcelo, et al. 
(1991) Biochemistry 30, 4863-73.). While such compounds form few specific contacts 
5 and binding is dominated by the positive charge, polyamide analogs of distamycin have 
been designed that form specific high affinity structures with DNA in the minor groove. 
Tn such compounds, sequence specificity is determined by the sequence of side-by-side 
amino acid pairings, where imidazole (Im) opposite pyrrole (Py) recognizes a GC base 
pair, Py-Im recognizes CG, Py-Py is degenerate for AT or TA, while Im-Im pairing is 
10 disfavored (Wade, et al. (1992) J. Am. Chem. Soc. 1 14, 87838794; Mrksich, et al. (1992) 
Q Proc. Natl Acad. ScL, USA 89,7586-7590; Wade, et al. (1993) Biochemistry 32, 

JJ 1 13851 1389; Pelton, et al. (1989) Proc. Natl. Acad. Sci., USA 86, 57235727; Pelton, et 

J al. (1990) J. Am. Chem. Soc. 112,1393-1399). This recognition motif generality has 

^ been demonstrated for a large number of sequences and is directly supported by NMR 

=P 15 data (Mrksich, et al. (1993) J. Am. Chem. Soc. 115, 2572-2576; Geierstanger, et al. 
3 (1994) Biochemistry 33, 3055-3062; Geierstanger, et al. (1993) J. Am. Chem. Soc. 

5 1 15,4474-4482; Geierstanger, et al. (1994) Science 266, 646-650; Mrksich, et al. (1995) 

!| J. Am. Chem. Soc. 1 17,3325-3332; Mrksich, et al. (1993) J. Am. Chem. Soc. 115, 9892- 

fl 9899; Dwyer, et al. (1993) J. Am. Chem. Soc. 115, 9900-9906; Mrksich, et al. (1994) J. 

20 Am. Chem. Soc. 116, 3663-3664; Mrksich, et al. (1994) J. Am. Chem. Soc. 116, 
79837988; Chen, et al. (1994) J. Am. Chem. Soc. 116, 6995-7005; Cho, et al. (1995) 
Proc. Natl. Acad. Sci., USA 92, 10389-10392). 

BRIEF DESCRIPTION OF THE FIGURES 

25 FIGURE 1 . Schematic models of polyamides targeted to GCN4 binding site. 
FIGURE 2. Schematic models of Arg-Pro-Arg polyamides. 
FIGURE 3. Structure of DNA binding domain of Hin recombinase. 
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FIGURE 4. 
FIGURE 5. 
FIGURE 6. 
FIGURE 7. 
5 FIGURE 8. 
FIGURE 9. 
FIGURE 10. 
FIGURE 11. 
FIGURE 12. 
10 FIGURE 13. 
FIGURE 14. 

O FIGURE 15. 

■XI 

^ 15 FIGURE 16. 

FIGURE 17. 



FIGURE 18. 
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Eight-ring hairpin polyamides. 

Synthesis of Arg-Pro-Arg polyamides. 

GCN4 gel mobility shift experiments. 

Arg-Pro-Arg- Arg-Arg-Arg polyamides. 

GCN4 gel mobility shift experiments. 

Aliphatic amino acid substitutions in positive patch domain. 

Structure of ImPyPyPy-y-PyPyPyPy-C7-RPR. 

Quantitative Dnase footprint experiments. 

Structural requirements for GCN4 binding. 

Binding domains and chemical structures of PA1 and PA2. 

Gel shift analysis of oligonucleotides 2-6, 9 on a 3' 32 P end-labeled 

restriction fragment. 

Gel shift analysis of oligonucleotides 2 (A) and 9 (B) on a 3' 32 P end- 
labeled restriction fragment. 

Gel shift analysis of oligonucleotide 2 on a 3' 32 P end-labeled restriction 
fragment generated with EcoRl and HindUl. 

Gel shift analysis of oligonucleotide 2 on a 3' 32 P end-labeled restriction 
fragment generated with EcoRl and HindHl. 

MPE and DNase I footprinting analysis of oligonucleotide2 on a 3' 32 P 
end-labeled restriction fragment generated with EcoRl and PvuU. 



SUMMARY OF THE INVENTION 

This invention provides improved polyamides comprising a positive patch that, 
upon binding of the polyamide to the minor groove of a DNA molecule, is able to contact 

25 nucleotides in the major groove of a DNA molecule. The positive patch may comprise 
any chemical moiety that delivers a charge to the DNA molecule. The invention further 
comprises polyamides comprising having the ability to alter the conformation of a DNA 
molecule such that the function of a conformation-dependent DNA binding protein is 
inhibited. As such, the polyamides inhibit gene expression by binding the minor groove 

30 DNA sequence and displacing or preventing the binding or function of DNA-binding 
proteins such as transcription factors. 
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DETAILED DESCRIPTION 

Within this application, unless otherwise stated, definitions of the terms and 
illustration of the techniques of this application may be found in any of several well- 
5 known references such as: Sambrook, J., et al., Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press (1989); Goeddel, D., ed., Gene Expression 
Technology, Methods in Enzymology, Vol. 185, Academic Press, San Diego, CA (1991); 
"Guide to Protein Purification" in Deutshcer, M.P., ed., Methods in Enzymology, 
Academic Press, San Diego, CA (1989); Innis, et al., PCR Protocols: A Guide to Methods 
10 and Applications, Academic Press, San Diego, CA (1990); Freshney, R.I., Culture of 
Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan Liss, Inc. New York, NY 
(1987); Murray, E.J., ed., Gene Transfer and Expression Protocols, pp. 109-128, The 
3 Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University Press, New 

Q York (1997). 

^ 15 For the purposes of this application, a promoter is a regulatory sequence of DNA 

Sj that is involved in the binding of RNA polymerase to initiate transcription of a gene. A 

~ gene is a segment of DNA involved in producing a peptide, polypeptide or protein, 

=^ including the coding region, non-coding regions preceding ("leader") and following 

iS ("trailer**) the coding region, as well as intervening non-coding sequences ("introns") 

^ 20 between individual coding segments ("exons"). Coding refers to the representation of 
amino acids, start and stop signals in a three base "triplet" code. Promoters are often 
y upstream ("5' to") the transcription initiation site of the corresponding gene. Other 

regulatory sequences of DNA in addition to promoters are known, including sequences 
involved with the binding of transcription factors, including response elements that are 
25 the DNA sequences bound by inducible factors. Enhancers comprise yet another group of 
regulatory sequences of DNA that can increase the utilization of promoters, and can 
function in either orientation (5'-3* or 3'-5') and in any location (upstream or 
downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the 
30 regulatory sequence increases transcription, thereby resulting in increased expression of 
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the corresponding target <>ene. In such a case, interference with transcription by binding a 
polyamide to a regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in 
the art as a silencer. A silencer sequence generally has a negative regulatory effect on 
5 expression of the gene. In such a case, expression of a gene may be increased directly by 
using a polyamide to prevent binding of a factor to a silencer regulatory sequence or 
indirectly, by using a polyamide to block transcription of a factor to a silencer regulatory 
sequence. 

It is to be understood that the polyamides of this invention bind to double stranded 
10 DNA in a sequence specific manner. The function of a segment of DNA of a given 

sequence, such as 5'-TATAAA-3\ depends on its position relative to other functional 

regions in the DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the sense 
□ strand of DNA is positioned about 30 base pairs upstream of the transcription start site, 

\7\ the sequence forms part of the promoter region (Lewin, Genes VI, pp. 831-835). On the 

^ 1 5 other hand, if the sequence 5 '-TATAAA-3 * is downstream of the transcription start site in 
\A a coding region and in proper register with the reading frame, the sequence encodes the 

tyrosyl and lysyl amino acid residues (Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the 
% polyamides of this invention modulate gene expression by altering the binding of DNA 

M 20 binding proteins, such as RNA polymerase, transcription factors, TBF, ThlllB and other 
[q proteins. The effect on gene expression of polyamide binding to a segment of double 

0 stranded DNA is believed to be related to the function, e.g., promoter, of that segment of 

DNA. 

It is to be understood by one skilled in the art that the improved polyamides of 
25 the present invention may bind to any of the above-described DNA sequences or any 
other sequence having a desired effect upon expression of a gene. In addition, U.S. Patent 
No. 5,578,444 describes numerous promoter targeting sequences from which base pair 
sequences for targeting an improved polyamide of the present invention may be 
identified. 

30 It is generally understood by those skilled in the art that the basic structure of 

DNA in a living cell includes both major and a minor groove. For the purposes of 



describing the present invention, the minor groove is the narrow groove of DNA and the 
major groove is the deep groove of DNA as illustrated in common molecular biology 
references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

It is further understood by those skilled in the art that a DNA binding protein is a 
5 protein capable of making contact with a DNA molecule, generally in the major groove, 
through hydrogen bonds, ionic bonds and/or hydrophobic interactions as illlustrated in 
common molecular biology references such as Lewin, supra, or Alberts, et al., Eds., 
Molecular Biology of the Cell, 3 rd , Ed, Garland Publishing, Inc., New York, 1994. 
Preferably, a DNA-binding protein is one that affects gene expression following binding 
10 to a DNA molecule. 

Several basic motifs of DNA-binding proteins are known to those skilled in the 
art. One such conformation is the helix-turn-helix motif, which includes a specific 
subclass known as homeodomain proteins. Other common motifs include the zinc finger 
motif, the leucine zipper motif, and the helix-loop-helix motif. A DNA-binding protein 
15 of this application may maintain any of the above-described motifs or any other motif 
that provides a protein with the ability to bind DNA and affect gene expression. 

To affect gene expression in a cell, which may include causing an increase or a 
decrease in gene expression, a effective quantity of one or more polyamide is contacted 
with the cell and internalized by the cell. The cell may be contacted in vivo or in vitro. 
20 Effective extracellular concentrations of polyamides that can modulate gene expression 
range from about 10 nanomolar to about 1 micromolar. Gottesfeld, J.M., et al, Nature 
fl 387 202-205 (1997). To determine effective amounts and concentrations of polyamides in 

vitro, a suitable number of cells is plated on tissue culture plates and various quantities of 
one or more polyamide are added to separate wells. Gene expression following exposure 
25 to a polyamide can be monitored in the cells or medium by detecting the amount of the 
protein gene product present as determined by various techniques utilizing specific 
antibodies, including ELISA and western blot. Alternatively, gene expression following 
exposure to a polyamide can be monitored by detecting the amount of messenger RNA 
present as determined by various techniques, including northern blot and RT-PCR. 
30 Similarly, to determine effective amounts and concentrations of polyamides for 

in vivo administration, a sample of body tissue or fluid, such as plasma, blood, urine, 
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cerebrospinal fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate 
tissue source is analyzed. Gene expression following exposure to a polyamide can be 
monitored by detecting the amount of the protein gene product present as determined by 
various techniques utilizing specific antibodies, including ELISA and western blot. 
5 Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of messenger RNA present as determined by various techniques, 
including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and 
therapeutic compositions for in vivo or in vitro use. Representative methods of 
10 formulation may be found in Remington: The Science and Practice of Pharmacy, 19th 
ed., Mack Publishing Co., Easton, PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically 
Q acceptable pharmaceutical composition that is administered to a patient in need of 

'ff, treatment or an animal for medical or research purposes. The polyamide composition 

Si 15 comprises pharmaceutical^ acceptable carriers, excipients, adjuvants, stabilizers, and 
>u vehicles. The composition may be in solid, liquid, gel, or aerosol form. The polyamide 

□ composition of the present invention may be administered in various dosage forms orally, 

!~ parentally, by inhalation spray, rectally, or topically. The term parenteral as used herein 

includes, subcutaneous, intravenous, intramuscular, intrasternal, infusion techniques or 
^ 20 intraperitoneal^. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
compound, the intended use, the nature and severity of the condition being treated or 
diagnosed, the age, weight, gender, physical condition and mental acuity of the intended 
25 recipient as well as the route of administration. Such considerations are within the 
purview of the skilled artisan. Thus, the dosage regimen may vary widely, but can be 
determined routinely using standard methods. 

Polyamides of the present invention are also useful for detecting the presence of 
double stranded DNA of a specific sequence for diagnostic or preparative purposes. The 
30 sample containing the double stranded DNA can be contacted by polyamide linked to a 
solid substrate, thereby isolating DNA comprising a desired sequence. Alternatively, 




polyamides linked to a suitable detectable marker, such as biotin, a hapten, a radioisotope 
or a dye molecule, can be contacted by a sample containing double stranded DNA. 

The design of Afunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
5 specifically bind with subnanomolar affinity to the minor groove of a predetermined 
sequence of double stranded DNA are linked to a functional molecule, providing the 
corresponding Afunctional conjugates useful in molecular biology, genomic sequencing, 
and human medicine. Polyamides of this invention can be conjugated to a variety of 
functional molecules, which can be independently chosen from but is not limited to 
10 arylboronic acids, biotins, polyhistidines comprised from about 2 to 8 amino acids, 
haptens to which an antibody binds, solid phase supports, oligodeoxynucleotides, N- 
ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic acid, 

O acridine, captothesin, pyrene, mitomycin, texas red, anthracene, anthrinilic acid, avidin, 

in 

m DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 4-(psoraen-8-yloxy)-butyrate, 

^ 15 tartaric acid, (+)-a-tocopheral, EDTA, methidium, acridine, Ni(H)*Gly-Gly-His, thiazole 
SJ orange (TO), Dansyl, pyrene, N-bromoacetamide, and gold particles. Such Afunctional 

^ polyamides are useful for DNA affinity capture, covalent DNA modification, oxidative 

^ DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for DNA 

|S detection by providing a polyamide linked to a detectable label. Detailed instructions for 

20 synthesis of such bifunctional polyamides can be found in copending U.S. provisional 

ill 

Ifi application 60/043,444, the teachings of which are incorporated by reference. 

s DNA complexed to a labeled polyamide can then be determined using the 

appropriate detection system as is well known to one skilled in the art. For example, 
DNA associated with a polyamide linked to biotin can be detected by a streptavidin / 
25 alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, 
for assaying for the presence of the double stranded DNA sequence bound by the 
polyamide of this invention in a body sample, such brain tissue, cell suspensions or tissue 
sections, or body fluid samples such as CSF, blood, plasma or serum, where it is desirable 
30 to detect the presence, and preferably the amount, of the double stranded DNA sequence 
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bound by the polyamide in the sample according to the diagnostic methods described 
herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
5 packaged reagent(s) are also typically included. As used herein, the term "package" 
refers to a solid matrix or material such as glass, plastic (e.g., polyethylene, 
polypropylene or polycarbonate), paper, foil and the like capable of holding within fixed 
limits a polyamide of the present invention. Thus, for example, a package can be a glass 
vial used to contain milligram quantities of a contemplated polyamide or it can be a 
10 microtiter plate well to which microgram quantities of a contemplated polyamide have 
been operatively affixed, i.e., linked so as to be capable of being bound by the target 
DNA sequence. "Instructions for use" typically include a tangible expression describing 
the reagent concentration or at least one assay method parameter such as the relative 
\j\ amounts of reagent and sample to be admixed, maintenance time periods for reagent or 

11 15 sample admixtures, temperature, buffer conditions and the like. A diagnostic system of 
M the present invention preferably also includes a detectable label and a detecting or 

5 indicating means capable of signaling the binding of the contemplated polyamide of the 

* present invention to the target DNA sequence. As noted above, numerous detectable 

m labels, such as biotin, and detecting or indicating means, such as enzyme-linked (direct or 

r; 20 indirect) streptavidin, are well known in the art. 

E Trauger,et al. (Nature, 382: 559-561) and Swalley, et al. (J. Am. Chem. Soc. 119: 

6953-6961) have described recognition of DNA by certain polyamides at subnanomolar 
concentrations. Pairing specific carboxyamide groups allows for recognition of specific 
DNA sequences (Swalley, et al. supra). Polyamides comprising Hp, Im, and Py provide 
25 for coded targeting of pre-determined DNA sequences with high affinity and specificity. 
Hp, Im, and Py polyamides may be combined to form Im/Py, Py/Im, Hp/Py, and Py/Hp 
binding pairs which complement the four Watson-Crick base pairs A, C, G, and T. Table 
1 illustrates such pairings. 
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TABLE 1 



Pairing Codes for Base Pair Recognition* 



Pair 


OC 


OG 


T'A 


A»T 


Im/Py 


+ 








Py/Im 




+ 






Hp/Py 






+ 




Py/Hp 








+ 



♦favored (+), disfavored (-) 



5 Three-, four-, five- or six-ring improved polyamides of the present invention are 

covalently coupled to form six-, eight-, ten- or twelve-ring strucures, respectively, that 
bind specifically to four or six base pair targets, respectively, at subnanomolar 
concentrations. As such, the improved polyamides of the present invention may be 
J3 directed to any DNA sequence comprised of A, C, G, or T. 

Li 10 The instant invention provides polyamides having the ability to interfere with 

P gene expression by altering the topology of a DNA molecule physically or by altering the 

p chemical environment of the DNA molecule. By altering the topology of a DNA 

^ molecule, it is possible to inhibit the function of DNA-binding proteins that are 

□ dependent on DNA conformation for binding. The inclusion of a positive patch in a 

:j: 15 polyamide allows for alteration of the chemical environment surrounding the DNA 
! U molecule and serves to inhibit binding or function of DNA-binding proteins that bind the 

; U major groove of a DNA molecule. 

Transcription and replication of DNA is dependent upon intrinsic DNA curvature 
and protein-induced DNA bending. Previously, sequence-specific DNA bending ligands 
20 have been designed to bind two noncontiguous target sites in the major groove and 
induce a bend in the DNA (Liberies, D. A. & Dervan, P. B. (1996) Proc. Natl Acad. 
ScL, USA 93, 9510-4). This bend was shown to be dependent upon the linker length 
connecting the two sites. The present invention comprises sequence-specific polyamides 
targeted to the minor groove of the double helical linker region not overlapping either 
25 triple helical region, are capable of inhibiting bidentate third strand oligonucleotide 
binding. This inhibition through rigidification of the duplex is dependent upon the bend 




angle of the DNA, but is independent of the order of addition of bending and 

straightening ligands. As such, polyamides may be useful for displacing DNA bending 

transcription factors in cells. 

It has been demonstrated that artificial sequence-specific DNA bending ligands 
5 can be designed that utilize bidentate tight binding third strand oligonucleotides to 

constrict the intervening duplex and bend DNA (Liberies, et al. (1996) Proc. Natl. Acad. 

Set, USA 93, 9510-4; Akiyama, et al. (1996) Proc. Natl Acad. ScL, USA 93, 12122- 

12127; Akiyama, et al. (1996) J. Biological Chemistry 271, 29126-29135; Akiyama, et al. 

(1997) Biochemistry 36, 2307-2315). Presented herein is a mechanism of action by which 
10 certain improved polyamides affect DNA conformation, thus inhibiting the function of 

DNA-binding proteins that are dependent upon the conformation of DNA for binding. 

While compounds such as DAPI form a few specific contacts and binding is dominated 
w by the positive charge of the compound, polyamide analogs of distamycin have been 

u\ designed that form specific high affinity structures with DNA in the minor groove. 

^ 15 Provided herein are polyamides having the ability to displace a DNA bending by 
sj rigidification of a bent region not contacted by the ligand. A polyamide of the present 

^ invention may be utilized to "straighten", defined herein as altering the conformation of a 



designed to disrupt the unique microenvironment representing a protein binding site on 
the DNA double helix. The present invention comprises improved polyamides having a 
positive patch. Preferably, the positive patch contacts the phosphate backbone and 
disrupts the microenvironment, thereby preventing the binding and/or function of proteins 

25 that bind the DNA major groove. Polyamides that deliver a positive patch to the DNA 
backbone most likely destabilize the contacts between the protein side chains and the 
phosphate residues, and thereby inhibit protein binding (Bruice, et al. (1992). Proc^Natl 
Acad. ScL USA 89, 1700-1704; Chiang, et al. (1997). Proc. Natl. Acad. ScL USA 94, 
281 1-2816; Bruice, et al. (1997). Bioorg. Med. Chem. 5, 685-692) (Figure 1). 

30 The positive patch comprises any chemical moiety that is capable of delivering a 

charge to the chemical environment of the DNA molecule. The positive patch comprises 
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DNA molecule such that a confonnationally-dependent DNA-binding protein is unable to 
bind to the DNA or function properly, DNA molecules of widely varying sequence. 

The negatively charged DNA phosphate backbone provides a target for ligands 
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a charged group placed on the C-terminus, N-terminus, N-methyl group or other 
modifiable position of the polyamide. Preferably, the charged group comprises any 
amino acid having a net charge of at least +1. More preferably, the charged group is a 
primary, secondary, tertiary, quartemary amino group or a guanidinium or amidinium 
5 group. In one embodiment, the positive patch comprises an amino acid residue having a 
net positive charge joined to the C-terminus of a polyamide such that the polyamide has 
the ability to displace or prevent the binding of a DNA-binding protein to the major 
groove of a DNA molecule (Figure 12). 

It has been demonstrated that a variety of polyamide motifs co-occupy the DNA 
10 helix at sites both overlapping and adjacent to certain DNA-binding proteins, such as 
GCN4 (Figure 1). Table 2 shows schematic models of polyamides targeted to the binding 
site of the bZIP transcriptional activator, GCN4: ImPyPy-Dp, ImPyPy-y-ImPyPy-p-Dp, 
% ImPyPyPy-y-PyPyPyPy-p-Dp, and ImPyPy-y-ImPyPy-p-PyPyPy-G-Dp. The nine-base 

W pair (S'-CTGACTAAT-S') GCN4 binding site is indicated by brackets above and below 

£ 15 the base pairs. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) 
^ polyamide rings, respectively. Diamonds and triangles represent (i-alanine (p) and 

£ glycine (G), respectively. y-Aminobutyric acid (y) and dimethylaminopropylamide (Dp) 

m are depicted as a curved line and a plus sign, respectively. Polyamide binding sites are 

S shown in bold. Equilibrium association constants (ATa) for each polyamide binding to the 

ij 20 indicated match site are shown al the right. Association constants were determined by 
0 DNase I footprinting; simultaneous binding was determined by gel mobility shift assay. 



Table 2 

Polyamide Motifs That Bind Simultaneously with Major Groove DNA-Binding 

Proteins 



Polyamide Complex M M "') 

5'-T G ClC f 6 A C f A A TlA G T -3* 

ImPyPy-Dp ^^O*^ 1 X 105 



5'-T G ClC T 0 A C f A A TlA G T -3' 
ImPyPy^lmPyPy-p-Dp (SoS^ 1 * 10? 

3'-A C GiGACTOATT AiT C A -5' 



5'-GlC T C A C T A A TIT O I T A I C-3' 
lmPyPyPy-7-PyPyPyPy-^-Dp +XOOOO^ 1 x 10 9 

V-C.G A C T G A T T AiA C A A T A G-5 ' 



5'-T G CfrfTRTTTflA G T -3' 
ImPyPy^lmPyPy-p-PyPyPy-G-Dp (^§2 <>00 ^ 1 X 10 10 

3'-A C G.C A C T P A T T A.T C A -5' 



;L, 5 

M The improved polyamides of the present invention have at least three consecutive 

!jg carboxamide pairings for binding DNA in the minor groove of a regulatory sequence of a 

B duplex gene sequence, and a positive patch sequence for interference with DNA-binding 

10 protein function. The improved polyamides may further comprise a chiral hairpin turn 
having a stereochemical center substituted at the y-position of the chiral hairpin turn of 
the molecule with the R-enantiomer of 2,4-diaminobutyric acid (H 2 NHCHCH 2 CHNH 2 - 

COOH;"(/?) H2N Y"). 

The present invention comprises improved polyamides having three or four-ring 
15 polyamide structures covalently coupled to form six-, eight-, ten- or twelve-ring hairpin 
structures, respectively, of the general structures I- VIII: 
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X.XjXjyXAX.A X 1 X 2 X 3 X 4 yX 5 X 6 X 7 X 8 A X 1 X 2 X 3 X 4 X 5 yX 6 X 7 X 8 X 9 X 10 A 
I II HI 

Xj X 2 X 3 X 4 X5X 6 yX 7 X 8 X* > X j qX j j X j 2 A 
IV 

XjX 2 X 3 X t jX ! 2 X 1 ^ X|X 2 X3X 4s ^ X'iX^X^X^^ 

AX^xf N UC 6 X , 5 X , 4 / N p AX 8 X 7 X 6 xf X LX , 8 X 7 X , 6 X , 5 p 

V VI 



X t X 2 X 3 X 4 X 5 X ! j X f 2 X 3X4X5 ^ 

Y Y 
AX^XgX^ N LX , ,oX , ^ 7 X , . / \ 

vn 



N X 1 X 2 X3X 4 X 5 X 6 X'jX^X^X^X'jX'j 

w AXi 2 X n X 10 X9XgX7 LXi 2 X n X 1 oX 9 X 8 X , 7 p 

■JE 

vin 

i where X M2 and X , ,., 2 are independently an imidazole such as N-methylimidazole (Im), a 

jrj pyrrole such as N-methylpyrrole (Py), or a hydroxypyirole such as 3-hydroxy-N-methyl 

l D 10 pyrrole (Hp). In addition, an improved polyamide of the present invention may further 
** include a aliphatic amino acid such as p-alanine residue (p), an amide group such as 

dimethylaminopropylamide (Dp), an alcohol such as EtOH, an acid such as EDTA, or 
any derivative thereof may be joined to the P residue. 

"A" represents a positive patch which comprises any chemical moiety that is 
15 capable of delivering a positive charge to the major groove of a DNA molecule. 
Preferably, the positive patch comprises a rigid group joined to a positively charged 
group. The rigid group positions the positively charged group such that contact with the 
major groove or the phosphate backbone of a DNA molecule is accomplished. 

Preferably, the rigid group comprises one to ten amino acids. More preferably, 
20 the rigid group comprises one to eight amino acids. More preferably, the rigid group 
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comprises one to six amino acids. More preferably, the rigid group comprises one to four 
amino acids. Most preferably, the rigid group comprises two amino acids. Of the most 
preferred rigid group, the first amino acid is positioned adjacent to the poly amide and 
may comprise arginine, proline, lysine, hydroxyproline, the corresponding L or D 
5 enantiomers thereof or a derivative thereof. Preferably, the first amino acid is arginine or 
lysine and most preferably the first amino acid is arginine. The second amino acid is 
positioned at the caiboxy end of the first amino acid. Suitable second amino acids 
comprise proline, glycine, serine, threonine, leucine, isoleucine, valine, alanine, 
hydroxyproline the corresponding L or D enantiomers thereof or a derivative thereof. 
10 Preferably, the second amino acid is proline or glycine and most preferably the second 
amino acid is proline. 

A suitable positively charged group comprises a synthetic or naturally occurring 
amino acid. Preferably, the positively charged group is a primary amino group, 
secondary amino group, tertiary amino group, quartenary amino group, guanidinium 
15 group or amidinium group. It is preferred that the positively charged group is an amino 
acid bearing a net charge of at least +1 . More preferably, the positively charged group is 
arginine, lysine, histidine or a derivative thereof. Most preferably, the positively charged 
group is arginine. 

m The positive patch may be joined to the polyamide using an "attachment" group. 

!r! 20 Preferably, the attachment group comprises an amino acid. More preferably, the 

cu 

attachement moiety is p-alanine, y-aminobutyric acid, valeric acid, or any of the 

; S ' 

corresponding 2-amino derivatives of p-alanine, y-aminobutyric acid, or valeric acid. 

Most preferably, the attachment group is P-alanine. 

In addition, an improved polyamide of the present invention may further include a 

25 aliphatic amino acid such as p-alanine residue (P), an amide group such as 

dimethylaminopropylamide (Dp), an alcohol such as EtOH, an acid such as EDTA, or 

any derivative thereof may be joined to the p residue. 

P-alanine may also be utilized in place of a pyrrole or hydroxypyrrole amino acid 

in Formulas I-VHL The use of P-alanine in place of a pyrrole or hydroxypyrrole amino 

30 acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and 



p/Py) and aliphatic/aliphatic pairing (p/p) substitution. Such substitutions may comprise 
those described in provisional application 60/042,022, incorporated herein by reference. 
The use of y-aminobutyric acid, or a substituted y-aminobutyric acid such as (R)-2,4 
diaminobutyric acid, provides for preferred hairpin turns. Inclusion of the positive patch 
5 allows the polyamide to deliver a positive residue to the DNA backbone and interfere 
with protein-phosphate contacts. As such, the positive patch contacts the major groove or 
the phosphate backbone of a DNA molecule and inhibits the binding or function of DNA- 
binding proteins. Many other groups suitable for the purposes of practicing this invention 
are well known and widely available to one skilled in the art. 
1 0 The polyamide subunit structures I-VTH above may be covalently coupled through 

the y residue which represents a -NH-CH 2 -CT 2 -CH 2 -CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric acid, 
jj The present invention provides the reagents and methodologies for substituting the y- 
rj residue of certain polyamides with a moiety such as (/?)-2,4,-diaminobutyric acid 

=F 15 ((/i) H2N y). the NMR structure of a hairpin polyamide of sequence composition 

□ ImPyPy-y-PyPyPy complexed with a 5'-TGTTA-3* target site indicated that it was 
^ possible to substitute the a-position of the y-aminobutyric acid residue within the hairpin- 

□ DNA complex (de Claire, et al. J. Am. Chem. Soc. 1997, 779, 7909). Modeling indicated 
\1 that replacing the a-H of y with an amino group that may confer an ^-configuration at the 
• z 20 a-carbon could be accommodated within the floor and walls of the minor groove as 

demonstrated in Figure 1 and 2A. In contrast, the (S>2,4,-diaminobutyric acid ((5) y ) 
linked hairpin is predicted to clash with the walls of the minor groove of the DNA helix 
as illustrated in Figures 1 and 2B. 

In Formulas V-VTII, L represents an amino acid linking group such as p-alanine 
25 or 5-aminovaleric acid (8) bound to the y residue of a first polyamide and to the 
carboxytail of a second polyamide. As such, two or more polyamides may be linked, 
forming a tandemly-linked polyamide. Such a polyamide is said to be tandemly-linked or 
a tandem-linked polyamide. 
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P represents from zero to ten polyamides of formulas IX-XII: 

X,X 2 X3yX 4 X5X < X^XjX^XjX^Xg X 1 X2X3X4X5YX 6 X 7 XgX 9 X 10 
IX X XI 

X|X2X 3 X4X5X 6 yX7X 8 X 9 X 10 Xi iX| 2 

xn 

that may be tandemly linked to another polyamide. Preferably, P represents from zero to 
eight polyamides of formulas DC-XII. More preferably, P represents from zero to six 
polyamides of formulas IX-XII. More preferably, P represents from zero to four 
polyamides of formulas"IX-XII. ^Niost preferably, P represents from zero to two 
polyamides of formulas IX-XII. In Formulas IX-XII, X,-X 12 are as defined above. 
Tandem linking of polyamides provides expanded binding site size and increased binding 
affinity without compromising selectivity. Many other groups suitable for the purposes 
of practicing this invention are well known and widely available to one skilled in the art. 

Baird, et al. (7. Am. Chem. Soc. 118: 6141-6146) and PCT/US97/003332 describe 
methods for synthesis of polyamides which are suitable for preparing polyamides of this 
invention. Polyamides of the present invention may be synthesized by solid phase 
methods using compounds such as Boc-protected 3-methoxypyrrole, imidazole, and 
pyrrole aromatic amino acids, which are cleaved from the support by aminolysis, 
deprotected with sodium thiophenoxide, and purified by reverse-phase HPLC. The 
identity and purity of the polyamides may be verified using any of a variety of analytical 
techniques available to one skilled in the art such as 1H-NMR, analytical HPLC, and/or 
matrix-assisted laser-desorption ionization time-of-flight mass spectrometry (MALDI- 
TOF MS-monoisotropic). 

Described herein is the synthesis of a new class of polyamides and their 
characterization with regard to DNA binding affinity and sequence specificity. Revealed 
herein to one skilled in the art are properties of positive patch elements that may be 
utilized as a guide in the design of more efficient polyamides. The present invention 
provides one skilled in the art with the reagents and methodologies for the design, 
synthesis and utilization of polyamides comprising a positive patch. As a representative 
example, a series of polyamides with Arg-Pro-Arg tripepetides at the C-terminus are 
provided. It is demonstrated that such polyamides selectively inhibit DNA binding by the 




major groove-binding transcription factor GCN4, as measured by gel mobility shift 
assays. Polyamides having certain residues of the positive patch substituted are provided 
to identify the function of each amino acid in inhibition of protein binding. Also 
provided are DNase I footprint titration experiments to measure the effect of net ligand 
5 charge on both the DNA binding affinity and specificity of the hairpin polyamide. As 
demonstrated herein, addition of an Arg-Pro-Arg tripeptide does not result in significant 
alteration of polyamide-DNA binding affinity or specificity. 

The instant invention provides polyamides that act as synthetic ligands to affect 
binding of proteins with affinity for the major groove of DNA, as well as methods of 
10 making and using such polyamides. In addition, sequences adjacent to or neighboring the 
protein binding site may be targeted, as exemplified by selective inhibition of GCN4 at 
two sites using Arg-Pro-Arg-polyamides. The Arg-Pro-Arg domain appears to adopt a 
□ stable and defined structure which delivers a neutralizing positive charge to the DNA 

Tfi backbone where it competes with GCN4 for contact to the phosphates. The broad 

^ 15 targetable sequence repertoire of polyamides, coupled with the ubiquity of backbone 
\\ contacts in protein recognition of DNA, make phosphate neutralization by a positive 

=2 patch a promising approach for inhibition of major groove transcription factors. 

One skilled in the art may utilize the examples provided herein to design 
^ polyamides comprising a positive patch sequence. The examples listed above and those 

M 20 illustrated below represent only certain embodiments of the present invention and are not 
[ limiting of the specification and claims in any way. 

EXAMPLES 
Example 1 

25 Materials and Methods 

All buffers for gel mobility shift and footprinting experiments were prepared from 
J.T. Baker reagents and 0.2 ^iM filtered. EDTA and DTT were obtained from Gibco 
BRL. Poly(dI-dC)*poly(dI-dC) was from Pharmacia Biotech. Ficoll (MW 400,000) was 
purchased from Sigma. T4 polynucleotide kinase, £coRI, PvwII, and DNase I were from 




Boehringer Mannheim. AflU and Fspl were purchased from New England Biolabs. 

32 

Sequenase (version 2.0) was obtained from United States Biochemical, [a- P]- 
Thymidine-5-triphosphate (> 3000 Ci/mmol), [a- P]-deoxyadenosine-5'-triphosphate 
(>6000 Ci/mmol), and were purchased from Du Pont/NEN. [y- 32 P]-Adenosine-5'- 
5 triphosphate (> 7000 Ci/mmol) was obtained from ICN. GCN4 (222-281) was prepared 
by solid phase synthesis. MBHA resin (0.57 mmol/g) was from Calbiochem. Boc-p-Ala, 
Boc-Y<mtinobutyric,^ and 
Boc-D-Pro were from Peptides International. p-Cresol was purchased from Aldrich. All 
other chemicals, as well as the purification and characterization of polyamides were as 
10 previously described (Baird, et al. (1996) 7. Am. Chem. Soc. 118, 6141-6146). 

^ Example 2 

\jj Arg-Pro-Arg Polyamides 

r Schematic models of certain polyamides targeted to the binding site of the bZIP 

^ 15 transcriptional activator GCN4 are illustrated in Figure 1, parts a-d. A series of 
£ polyamides with Arg-Pro-Arg tripeptides at the C-terminus were synthesized by solid 

L, phase methods. The polyamides were evaluated as inhibitors of the major groove 

31 transcription factor GCN4, the prototypical member of the basic region-leucine zipper 

(bZIP) family of transcriptional regulators (Hurst, H. C. (1995) Protein Profile 2, 105- 
fl 20 168; Struhl, K. (1992) Yeast GCN4 transcriptional activator protein. In Transcriptional 
'** Regulation. (McKnight, S. L. & Yamamoto, K. R., eds), pp. 833-859, Cold Spring 

Harbor Laboratory Press, New York; Curran, T.& Vogt, P. (1992) Dangerous liasions: 
fos and jun, oncogenic transcription factors. In Transcriptional Regulation. (McKnight, 
S. L. & Yamamoto, K. R., eds), pp. 797-832, Cold Spring Harbor Laboratory Press, New 
25 York). The C-terminal sixty amino acids (222-281) of GCN4 contain the "leucine 
zipper" dimerization domain and the "basic region" which is responsible for DNA 
binding. GCN4 (222-281) has been shown to be sufficient for sequence specific binding 
(Oakley, et al. (1990) Science 248, 847-850; Hope, et al. (1986) Cell 46, 885-894; 
Paolella et al. (1994) Science 264, 1130-1133). The basic region of each a-helical 
30 monomer makes specific hydrogen bonds, van der Waals contacts, and phosphate 




interactions with c ie half-site of the nine base pair pseudosymmetrical GCN4 binding 

site (Figure 2 A & B) (Oakley, et al. (1990), supra; Ellenberger, et al. (1992) Cell 71, 

1223-1237; K6nig, et al. (1993) J. Mol. Biol. 233, 139-154). The protein-DNA 

electrostatic interactions which are targeted for disruption by the Arg-Pro-Arg- 

5 polyamides are highlighted in Figure 2B. 

Homeodomain proteins recognize the minor groove of DNA via a highly 

conserved Arginine (Arg) - Xaa - Arg (Xaa = any amino acid) (Gehring, W. J., et al., & 

Wttthrich, K. (1994) Homeodomain-DNA recognition. Cell 78, 211-223; Gehring, et al. 

(1994) Homeodomain proteins. Annu. Rev. Biochem. 63, 487-526). In Hin recombinase, 

10 the corresponding Arg 140 - Proline 141 (Pro) - Arg 142 domain serves as a bridge 

between the N-terminal arm in the minor groove and the helix-turn-helix motif which 

recognizes the major groove (Sluka, et al. (1990) Biochemistry 29, 6551-6561; Feng, et 

□ al. (1994) Science 263, 348-355). Minor groove contacts made by the side chain of 

^ Arg 140 direct the peptide chain up from the floor of the minor groove, toward the 

^ 149 

H 15 backbone, where the guanidinium of Arg makes electrostatic contact with phosphates. 
\i Upon interaction with DNA, the Arg-Pro-Arg domain achieves a stable, local tertiary 

3 structure which is potentially based solely on the primary sequence. It was postulated that 

Arg-Pro-Arg attached at the C-terminus of a polyamide would adopt a similar structure to 
2 that of Arg 140 -Pro 141 -Arg 142 in Hin recombinase. The resulting Arg-Pro-Arg- 

|s* 20 polyamide could be used to place a neutralizing positive charge at a predetermined 

phosphate on the DNA backbone (Figure 3). Arg-Pro-Arg-polyamide synthesis is 
■0 exemplified here for polyamide 3. 

ImPyPyPy^-PyPyPyPy-$-RPR (compound 3) 
Polyamides 1-4 are illustrated in Figure 4A-D. ImPyPyPy-y-PyPyPyPy-p-RPR- 
25 MBHA-resin was synthesized in a stepwise fashion by machine-assisted solid phase 
methods [44] from MB HA resin (600 mg, 0.57 mmol/g, calculated as L n ew(mmol/g) = 
L 0 ld/(1 + Lold(W n ew - W 0 ld) * 10" 3 ), where L is the loading (mmol of amine per gram 
of resin) and W is the weight (g mof 1 ) of the growing peptide attached to the resin 
(Barlos, et al. (1991) Int. J. Peptide Protein Res. 37, 513-520). A sample of polyamide 




resin (300 mg, 0.30 mmoi/g) was placed in a Kel-F reaction vessel, />-cresol (1 g) added, 
and the vessel cooled to -60 °C. HF was then condensed into the vessel. The solution 
was stirred vigorously for one hour (0 °C) and the excess HF was removed in vacuo. The 
reaction mixture was then treated with cold ethyl ether (50 mL) and the resulting 
5 resin/polyamide coprecipitate was collected by vacuum filtration. The polyamide was 
then extracted with CH3CN:H20:TFA (10:9:1), 0.1% (w/v) TFA added (6 mL) and the 
-resulting solution purified by reverse phase HPLC using a, Waters DeltaPak. 25 x 1.00.mm 
100 urn Ci8 column in 0.1% (w/v) TFA, gradient elution 0.25%/min. CH3CN. 
ImPyPyPy-y-PyPyPyPy-P-RPR-NH2 was recovered upon lyophilization of the 
10 appropriate fractions as a white powder (84 mg, 60% recovery); UV (H2O) X ma x 244, 
O 312 (66,000); *H NMR (DMSO-</6): § 10.49 (s, 1H), 9.96 (s, 1H), 9.93 (s, 1H), 9.90 (s, 

Q 3H), 9.84 (s, 1H), 8.19 (d, 1H, J=7.6 Hz), 8.07 (m, 1H), 8.02 (m, 1H), 7.95 (m, 1H), 7.91 

J= (m, 1H), 7.53 (m, 3H), 7.38 (s, 1H), 7.26 (d, 1H, J=1.6 Hz), 7.25 (s, 1H), 7.21 (d, 1H, 

« J=1.6 Hz), 7.20 (s, 1H), 7.19 (s, 1H), 7.14 (m, 3H), 7.03 (m, 4H), 6.87 (d, 1H, J=1.6 Hz), 

f 15 6.85 (m, 2H), 4.48 (t, 1H, J=4.8 Hz), 4.27 (q, 1H, J=4.4 Hz), 4.77 (m, 1H), 3.96 (s, 3H), 
g 3.81 (m, 12H), 3.80 (s, 3H), 3.77 (s, 3H), 3.76 (s, 3H), 3.60 (m, 2H), 3.32 (q, 2H, J=4.8 

\± Hz), 3.17 (q, 2H, J=6.1 Hz), 3.0 (m, 4H), 2.36 (t, 2H, J=6.9 Hz), 2.25 (t, 2H, J-6.9 Hz), 

£ 2.0 (m, 2H), 1 .77 (m, 4H), 1 .64 (m, 2H), 1 .46 (m, 6H). 

~ The polyamides ImPyPyPy-y-PyPyPyPy-p-Dp (1) and ImlmPyPy-y-ImPyPyPy-p- 

20 Dp (2) were synthesized in a stepwise manner from Boc-p-alanine-Pam resin using Boc- 
chemistry machine-assisted protocols as previously described (Baird, et al. (1996) 7. Am. 
Chem. Soc. 118, 6141-6146). Polyamides with C-terminal aliphatic amino acids were 
synthesized on MBHA resin from Im and Py monomer units and commercially available 
aliphatic amino acids in 26 steps (Figures 4 and 5). Treatment with HF:/>-cresol (9:1) 
25 followed by precipitation with ethyl ether and extraction with 0.1% TFA.CH3CN (50:50) 
afforded the deprotected polyamide which was purified by reverse phase HPLC. 




Example 3 

Selective inhibition of GCN4 (222-281) binding by Arg-Pro-Arg polyamides 

Synthetic DNA fragments were prepared on an ABI 380B Automated DNA 
Synthesizer and purified by preparative denaturing polyacrylamide gel electrophoresis. 
5 ARE-1 (5 ' -CCGGATCC ATGGTTGCTGACTAATTGTTATCCTCTAGAGTCGACC- 

3') and ARE-2 (5'-CCGGATCCATGGTTGCTGACTAATTGGTCTCCTCTAGAGTCG 
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-ACC-3 5 ) -were radiolabeled -at the 5Merminus with -y- P-ATP^and. polynucleotide 
kinase, annealed to an equimolar amount of the unlabeled complement, and purified by 
nondenaturing polyacrylamide gel electrophoresis (Sambrook, et al. Molecular Cloning. 
1 0 (2nd ed.). Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY). 
^ For gel mobility shifts, polyamide was incubated with radiolabeled synthetic 

"is? 

W DNA duplex (10 kcpm) in 40 \iL reaction volumes of bisTris (10 mM, pH 7.0), NaCl 

£ (100 mM), DTT (1 mM), EDTA (1 mM), and poly(dI-dC)»poly(dI-dC) (50 ng/mL) for 

□ 16 hours at 22 °C (20 mM MOPS, pH 7.0, 140 mM KC1, 10 mM NaCl, ImM MgCl2, 1 

T 15 mM spermine was used to model ionic conditions in vivo). GCN4 (222-281) was added 

y and equilibrated for 30 minutes. Loading buffer (15% Ficoll, 0.025% bromophenol blue) 

i~ (10 nL) was added and 10 nL was immediately loaded . onto a running 8% (29:1, 

: V 

; fl acrylamide:bis-acrylamide) polyacrylamide gel (0.5 X TBE, 280 V, 0.8 mm, 13 cm). 

■O ■ 

Sufficient separation of the free DNA and the DNA*GCN4 (222-281) complexes was 
20 achieved within 45 minutes. Gels were dried and exposed to a storage phosphor screen 

(Molecular Dynamics) (Johnston, et al. (1990) Electrophoresis 1 1, 355). 

Synthetic radiolabeled DNA duplexes, ARE-1 and ARE-2, containing a GCN4 

binding site (5 ' -CTGACT AAT-3 ') (Oakley, et al. (1990) Science 248, 847-85029; 

Oakley, et al. (1992) Biochemistry 31, 10969-10975), were bound near saturation at 200 
25 nM GCN4 (222-281) as revealed by gel mobility shift analysis (10 mM bisTris pH 7.0, 

100 mM NaCl, 1 mM DTT, 1 mM EDTA, 50 ng/mL poly(dI-dC)»poly(dI-dC), 22 °C). 

ImPyPyPy-y-PyPyPyPy-P-polyamides (1 and 3) target the six base pair 5'-TGTTAT-3' 
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site adjacent to the GCN4 binding site of ARE-1 (Figure 6). ImlmPyPy-y-ImPyPyPy-p- 
polyamides (2 and 4) were designed to bind 5'-TGGTCT-3' adjacent to the GCN4 site in 
ARE-2 (Figure 6). 

The ability of polyamides to inhibit GCN4 (222-281) binding was evaluated using 
5 the gel mobility shift assay. Increasing concentrations of polyamide were incubated with 
the desired radiolabeled synthetic DNA duplex followed by the addition of a constant 
concentration of 200 -nM-<jGN4-<«222-28t). DNA fragments .bound and .unbound by 
GCN4 were separated using nondenaturing polyacrylamide gel electrophoresis. 

Polyamides 1 and 2, which lack the Arg-Pro-Arg moiety were unable to inhibit 
10 GCN4 binding (Figure 6). The upper band in Figure 6 A is the ARE-1 fragment bound by 
; 3 GCN4 (222-281) (lanes 2-12). Lanes 3-11 show that GCN4 binding was unaffected by 

W the addition of ImPyPyPy-y-PyPyPyPy-p-Dp. However, ImPyPyPy-y-PyPyPyPy-p-RPR, 

:: p which differs from 1 by the addition of the C-terminal Arg-Pro-Arg, inhibited GCN4 

q binding to ARE-1 (Figure 6C). When bound to its match site on ARE-2, ImlmPyPy-y- 

T 1 5 ImPyPyPy-P-RPR also successfully inhibited GCN4 binding (Figure 6F). 
;jj Based on the pairing rules for polyamide-DNA complexes, the sites 5'-TGTTAT- 

t 3' (ARE-1 fragment) and 5'-TGGTCT-3' (ARE-2 fragment) are for ImPyPyPy-y- 

PyPyPyPy-RPR "match" and "double-base-pair mismatch" sites, respectively, and for 
ImlmPyPy-y-ImPyPyPy-RPR "double-base pair mismatch" and "match" sites, 
20 respectively. Incubation of GCN4 and up to 2 |iM ImPyPyPy-y-PyPyPyPy-P-RPR with 
the double mismatch ARE-2 fragment resulted in no inhibition of GCN4 binding (Figure 
6D). Likewise, ImlmPyPy-y-ImPyPyPy-P-RPR did not inhibit GCN4 binding to the 
•mismatched ARE-1 fragment (Figure 6E). 

When bound to their respective match sites, ImPyPyPy-y-PyPyPyPy-p-RPRRRR 
25 (5) and ImlmPyPy-y-ImPyPyPy-P-RPRRRR (6), which contain an additional three C- 
terminal arginine residues relative to 3 and 4, were found to fully inhibit GCN4 (222-281) 
binding (Figures 7 and 8). The gel mobility shift experiments depicted in Figure 6 




demonstrate that 5 and 6 selectively provided complete inhibition of GCN4 binding with 
no apparent loss in specificity for double-base-pair mismatches. 

Example 4 

Design of Optimum Tripeptide for GCN4 Inhibition 
Polyamides with deletions and/or substitutions in the Arg-Pro-Arg domain were 
prepared 4n order Xo -detennine-the,^lenients.which .were .essential , for GCN4 inhibition 
(Figure 9). Each of these polyamides was based on the ImPyPyPy-y-PyPyPyPy-p 
polyamide targeted to 5'-TGTTAT-3' of ARE-1. The ability of polyamides 7-14 
(Figures 9 and 10) to bind their DNA target sites and inhibit GCN4 binding to ARE-1 
was evaluated using DNase I footprinting and gel mobility shift analysis. 

DNase I footprinting of polyamides 1-14 was performed on restriction fragments 
containing the appropriate ARE-1 or ARE-2 sequences under conditions identical to 
those used for the gel mobility shift experiments. In every case, (except 11, see below) 
the polyamide was found to specifically bind the target site with ~ 1 x 10 7 M~ l 
(Figure 1 1). Lower K% values are observed for polyamides under the gel shift conditions 
due to the carrier DNA which artificially depresses polyamide binding constants. The 
polyamide concentrations required for GCN4 inhibition are within the expected range 
based on the under gel shift conditions. 

Deletion of the terminal Pro-Arg or Arg, as in ImPyPyPy-y-PyPyPyPy-P-R, (7) 
and ImPyPyPy-y-PyPyPyPy-P-RP (8), results in polyamides which are unable to inhibit 
GCN4 binding. Substituting the proline with glycine afforded ImPyPyPy-y-PyPyPyPy-p- 
RGR (9), which did not effectively inhibit GCN4 (222-281). At 1 nM of 9, < 50% of the 
GCN4 was inhibited. No inhibition of GCN4 binding was observed for ImPyPyPy-y- 
PyPyPyPy-p-R^PR (10) which contained a single inversion of stereochemistry relative 
to 3. 




The internal arginine was replaced with an alanine residue to provide ImPyPyPy- 
y-PyPyPyPy-p-APR (11). 11 was unable to inhibit GCN4 binding under these 
conditions. This Arg to Ala substitution was the only alteration which was found to 
affect polyamide binding affinity. By DNase I footprinting, 11 binds the S'-TGTTAT-S' 
5 target site with ten-fold lower affinity than 3 under conditions identical to those used for 
gel shift analysis. The conservative substitution of lysine for arginine in ImPyPyPy-y- 
PyPyPyPy-p-KPR (12) also compromised the polyamide's ability to inhibit GCN4. At 1 
HM 12, < 50% of the bound GCN4 was inhibited, similar to 9. However, the identical 
substitution in the C-terminal position afforded ImPyPyPy-y-PyPyPyPy-P-RPK (13), 
10 which inhibited GCN4 binding identically to the Arg-Pro-Arg polyamide 3. The amino 
Q acid linkage between the final Py amino acid and the initial arginine was also crucial for 

i7j GCN4 inhibition. A polyamide in which the p-alanine linker was replaced with a 7- 

J aminoheptanoic acid linker, ImPyPyPy-y-PyPyPyPy-C7-RPR (14), was unable to inhibit 

2 GCN4 binding. Protein inhibition did not require prebinding of polyamide. 

=P 15 Preincubation of ARE- 1 with GCN4 followed by addition of 5 afforded inhibition 
□ identical to that of prebound polyamide. 

U By targeting an 8-ring Arg-Pro-Arg-polyamide adjacent to a GCN4 binding site, 

!^ selective inhibition of DNA binding by a protein which exclusively contacts the major 

^ groove is achieved (Figures 4 and 6). The polyamide domain binds sequence specifically 

20 in the minor groove with double base pair mismatches preventing GCN4 inhibition. 

The inability of truncated analogs 7 (R) or 8 (RP) to inhibit GCN4 binding 
indicates that the C-terminal arginine in 3 (RPR) is crucial for GCN4 inhibition. Based on 
the Hin recombinase model, this arginine is expected to make non-specific contacts to the 
DNA phosphate backbone. The ability of 13 (RPK) to inhibit GCN4 identically to 3 
25 (RPR) supports this model. The neutralization of a portion of the backbone is the most 
likely mechanism by which Arg-Pro-Arg polyamides achieve GCN4 inhibition. Other 
models, such as steric blockage of the major groove or DNA distortion, cannot be ruled 




out (Strauss, et al. (1994) Science 266, 1829-1834). Modeling suggests that Arg-Pro-Arg 
is insufficient to cross the DNA backbone and block the major groove. Determination of 
the exact mechanism of inhibition awaits high-resolution structure studies which are in 
progress. 

5 The results of GCN4 inhibition experiments with polyamides 7-14 suggest that 

140 

the Arg-Pro-Arg of 3 and 4 adopts a stable and well-defined structure similar to Arg 
Pro 141 rArg 142 .of-Hin ,recGmbinase. Theinternal Arg-Pro of 3 and 4 (RPR) is required 
for GCN4 inhibition. Polyamide 8 does not inhibit GCN4, suggesting that these two 
residues play a structural role in the placement of the terminal arginine near the phosphate 
10 backbone. Replacing the rigid proline of 3 (RPR) with a flexible glycine (9) (RGR) 

□ allows significant amounts of GCN4 to remain bound in the presence of saturating 
ill concentrations of 9. The glycine in 9 may permit the C-terminal arginine to shift to a 

position which permits simultaneous binding with GCN4. 
^ ImPyPyPy-y-PyPyPyPy-p-R D PR (10) is a diastereomer of ImPyPyPy-y- 

^ 15 PyPyPyPy-P-RPR (3), but is unable to inhibit GCN4. Modeling indicates that 

□ substitution of D-proline for L-proline may in fact direct the neutralizing terminal arginine 
jH to the backbone of the opposite DNA strand. However, confirmation of this prediction 
if* awaits studies with other protein systems. 

. 

y Replacement of the internal arginine with alanine, as in 11, reduces binding 

20 affinity by a factor often and prevents GCN4 inhibition. Furthermore, the Lys-Pro-Arg 
polyamide, 12, exhibits a binding affinity comparable to 3, yet it is a less effective 
■ inhibitor of GCN4. Together, these results suggest that the guanidinium of the internal 
arginine makes specific contacts with the DNA which are required for the proper 
positioning of the remaining residues. Replacement of the p-alanine linker (3) with 7- 
25 aminoheptanoic acid (14) eliminates inhibition, further implicating the placement of the 
Arg-Pro-Arg moiety as a requirement for effective inhibition. 
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Example 5 

Salt Dependence of Positive Patch Polyamides 
In order to evaluate the sensitivity of positive patch mediated major groove 
protein inhibition to the nature of the compensating electrolyte, as well as the overall 
ionic strength, gel mobility shift analysis was performed using a buffer which models the 
environmentof^^^ 

1 mM MgCl2, 1 mM spermine) (Jones, et al. (1993) J. Org. Chem. 58, 2983-2991). Arg- 
Pro-Arg-polyamide 3 was found to inhibit GCN4 (222-281) binding under the in vivo 
ionic conditions which feature KC1 as the primary compensating electrolyte and the 
conditions optimized for protein binding which feature NaCl as the predominant 
compensating electrolyte (Figure 6C). Further biophysical characterization of major 
groove protein inhibition by positive patch polyamides will be reported in due course. 

Example 6 

Arg-Pro-Arg Polyamide Binding Affinity and Specificity 

In order to evaluate the effects of the Arg-Pro-Arg moiety on the DNA binding 

properties of the polyamides, quantitative DNase I footprint titration experiments were 

performed to determine the equilibrium association constants of polyamides 1-6 for their 

respective six base pair match and single-base-pair mismatch sites (10 mM Tris pH 7.0, 

10 mM KC1, 10 mM MgCl2, 5 mM CaCl2, 22 °C) (Brenowitz, et al. (1986) Methods 

Enzymol 130, 132-181). 

The Afl IVFsp I restriction fragment of pJT8 (Trauger, et al. (1996) Nature 382, 

559-561) was 3'- 32 P-end-labeled by digesting the plasmid with Afill and Fspl and 
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simultaneously filling in using Sequenase, [a- PJ-deoxyadenosine-S'-triphosphate, and 
[a- 32 P]-thymidine-5'-triphosphate, and isolating the 229 bp fragment by nondenaturing 
gel electrophoresis. The 250 bp £coRI/PvwII restriction fragment of pJK6 (Struhl, K. 
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(1992) Yeast GCN4 transcriptional activator protein. In Transcriptional Regulation. 
(McKnight, S. L. & Yamamoto, K. R., eds), pp. 833-859, Cold Spring Harbor Laboratory 
Press, New York) was prepared in an analogous manner. A and G sequencing were 
carried out as described (Maxam, et al. (1980) Methods Enzymoi 65, 499-560; Iverson, et 
5 al. (1987). Methods Enzymoi 15, 7823-7830). 

All reactions were executed in a total volume of 400 |iL. A polyamide stock 
-solution -or H2O -(for -reference -lanes) was . added -to ^m^assay .buffer .containing 
radiolabeled restriction fragment (20 kcpm), affording final solution conditions of 10 mM 
Tris»HCl (pH 7.0), 10 mM KC1, 10 mM MgCl2, 5 mM CaCl2, and either (i) 0.001 nM- 
10 100 nM polyamide or (ii) no polyamide (for reference lanes). The solutions were allowed 
j 3 to equilibrate at 22 °C for 18h. Footprinting reactions were initiated by the addition of 4 

ijj |iL of a DNase I stock solution (at the appropriate concentration to give ~ 55% intact 

: {! DNA) containing 1 mM dithiothreitol and allowed to proceed for seven min at 22 °C. 

A The reactions were stopped by the addition of 50 ^iL of a solution containing 1.25 M 

1 5 NaCl, 1 00 mM EDTA, 0,2 mg/mL glycogen, and 28 |aM base-pair calf thymus DNA, and 
3 ethanol precipitated. Reactions were resuspended in IX TBE/80% formamide loading 

\^ buffer, denatured by heating at 85 °C for 10 min, and placed on ice. The reaction 

;fl products were separated by electrophoresis on an 8% polyacrylamide gel (5% cross-link, 

7 M urea) in IX TBE at 2000 V for 1.5h. Gels were dried and exposed to a storage 
20 phosphor screen (Molecular Dynamics) (Johnston, et al. (1990) Electrophoresis 1 1, 355). 

Data from the footprint titration gels were obtained using a Molecular Dynamics 
400S Phosphorlmager followed by quantitation using ImageQuant software (Molecular 
Dynamics). Background-corrected volume integration of rectangles encompassing the 
footprint sites and a reference site at which DNase I reactivity was invariant across the 
25 titration generated values for the site intensities (I s j te ) and the reference intensity (I re f). 
The apparent fractional occupancy (0 a pp) of the sites were calculated using the 
equation: 
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0app = T . Isitf/U (i) 

Isite / Iref 

where I° s ite ^ *°ref ^ ^ e s * te an( * re f erence intensities, respectively, from a control 
lane to which no polyamide was added. The ([L] tot , 0 a pp) data points were fit to a 
Langmuir binding isotherm (eq 2, n=l) by minimizing the difference between 0 a pp and 
5 Gfit, using the modified Hill equation: 

0m = £Ln + (Qna* - 0min) ~ J } n (2) 

1 + l\a [LJ tot 

where [L] tot is the total polyamide concentration, K% is the equilibrium association 
constant, and 0 m in and 8 ma x are the experimentally determined site saturation values 
when the site is unoccupied or saturated, respectively. The data were fit using a nonlinear 
: ^ 1 0 least-squares fitting procedure with 9 ma x, and 9 m i n as the adjustable parameters. All 
l O acceptable fits had a correlation coefficient of R > 0.97. At least three sets of data were 

H used in determining each association constant. All lanes from each gel were used unless 

\i visual inspection revealed a data point to be obviously flawed relative to neighboring 

points. 

;U 15 DNase I footprinting of ImlmPyPy-y-ImPyPyPy-p-Dp (2), ImlmPyPy-y- 

® ImPyPyPy-p-RPR (4), and ImlmPyPy-y-ImPyPyPy-P-RPRRRR (6) was performed on 

!U the 250 bp £coRI/PvuII restriction fragment of pJK6 (Kelly, et al. (1993) Proc. Natl 

5 Acad. Sci., USA 93, 6981-6985). 2 bound the match sites, 5'-TGGTCA-3' and 5'- 

TGGACA-3', with identical affinities within experimental error (A^ a = 1.3 (±0.1) x 10 10 

20 M" 1 and 6.4 (±1.2) x 10 9 M" 1 , respectively). 2 also demonstrated greater than 100-fold 

7 -1 

specificity for a single base pair mismatch site 5'-TGTACA-3' (A^a < 5 x 10 M , 

mismatched base pair underlined). Similar affinity and a slight increase in specificity 

were observed for Arg-Pro-Arg polyamide 3. 5'-TGGTCA-3' and S'-TGGACA-S 1 were 

boundby 3 (K z = 4.6 (±0.2) x 10 10 M" 1 and 6.6 (±1.0) x 10 10 M" 1 , respectively) with 

7 -1 

25 greater than 450-fold specificity versus the mismatch site (K& = < 1 x 10 M ). The 
additional three terminal arginines of 6 generated a ten-fold increase in affinity relative to 




2 coupled with a significant loss in specificity for a single-base-pair mismatch. 6 bound 
5'-TGGTCA-3\ 5'-TGGACA-3\ and 5'-TGIACA-3' with affinities of 2.6 (±0.4) x 
10 10 M" 1 , 2.8 (±0.5) x 10 10 M" 1 and 1.9 (±0.8) x 10 10 M" 1 , respectively. 

Corresponding results were observed for DNase I footprinting of ImPyPyPy-y- 
5 PyPyPyPy-P-Dp (1), ImPyPyPy-y-PyPyPyPy-P-RPR (3) and ImPyPyPy-Y-PyPyPyPy-P- 
RPRRRR (5) on the 229 bp A/IlVFspl restriction fragment of pJT8 (Trauger, et al. (1996) 
Nature 382, 559-561). 4- has-been. -shown. .to .bind „its.~six.,base„pair match site, 5'- 
AGTTAT-3', with an affinity of 3.5 (±0.8) x 10 9 M" 1 and 7-fold specificity versus a 
single base pair mismatch site 5'-AGTACT-3' (K & = 5.0 (±0.5) x 10 8 M" 1 , Table 3) 
1 0 (Trauger, et al. (1996) Nature 382, 559-561). 



Table 3 

Equilibrium Association Constants (M 1 ) 





Polyamide 


5'-AGTATT-3' 


5'-AGTACT-3' 




ImPyPyPy-Y-PyPyPyPy-P-Dp 


3.5 x 10'* 


5.0 xlO 8 * 




ImPyPyPy-y-PyPyPyPy-P-RPR 


5.5 x 10 8 


9.2 x 10 7 




ImPyPyPy-Y-PyPyPyPy-P-RPRRRR 


1.0 x 10'° 


3.4 x 10' 



ffl "Values reported for the six-base pair match (5'-AGTATT-3') and mismatch (5'- 

\f AGTACT-3') (mismatch underlined) sites are the mean values obtained from three Dnase 

Hj 15 I footprint titration experiments on the Aflll / Fsp I restriction fragment of pJT8. The 



assays were carried out at 22°C at pH 7.0 in the presence of 10 mM Tris-HCl, 10 mM 
KC1, 10 mM MgCl 2 , and 5 mM CaCl 2 . *From Trauger, et al. (1996) Nature 382, 559- 
561. 

The Arg-Pro-Arg polyamide 3 demonstrated only a slight loss in affinity and a 

20 similar specificity (K a = 5.5 (±1.5) x 10 8 M" 1 for 5'-AGTATT-3' and 9.2 (±0.4) x 10 7 

M" 1 for 5'-AGTACT-3'). Analogous to 6, the additional terminal arginines of 5 

provided a ten-fold increase in affinity for the match site (K& = 1.0 (±0.2) x 10 10 M ') 

9 -1 

and a severe loss in specificity for a single-base-pair mismatch (K^AitO.S)* 1 0 M ). 
Quantitative DNase I footprinting demonstrates that the addition of a C-terminal 
25 Arg-Pro-Arg tripeptide as in 3 and 4 does not alter the DNA binding properties of eight- 
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ring hairpin polyamides (1 and 2). However, Arg-Pro-Arg-Arg-Arg-Arg-polyamides, 5 
and 6, have increased binding affinity but no specificity for a single base pair mismatch 
site. DNase I footprinting and gel mobility shift analysis demonstrate that 5 and 6 retain 
their specificity versus double base pair mismatch sites. These results indicate that 

5 synthetic ligands may balance the benefits of additional charge with the consequence of 
lowered sequence specificity (Breslauer, et al. (1988) The origins of the DNA binding 
-affinity and specificity-of minor -groove -directed 4igands: -correlations of .thermodynamic 
and structural data. In Structure and Expression (Vol. 2), DNA and Its Drug Complexes 
(Sarma, R. H. & Sarma, M. H. eds), pp. 273-289, Academic Press). For example, a 

10 distamycin analog modified with a decaaza decabutylamine moiety on a pyrrole nitrogen 
interferes with binding of a major groove transcription factor (Bruice, et al. (1997) 
Bioorg. Med. Chem. 5, 685-692). Unfortunately, the sequence specificity of this 
molecule which contains potentially eleven positive charges has not been reported. The 
results described here suggest that such a molecule may bind DNA with reduced 

15 sequence specificity. 

EXAMPLE 7 
Inhibition of DNA-Binding Proteins by Polyamides 

Oligonucleotides were synthesized and purified as previously described (Liberies, 
20 et al. (1996) Proc. Natl. Acad. Sci., USA 93, 9510-4). Polyamides were also 
synthesized as previously described and by Baird, et al. (1996, J. Am. Chem. Soc. 118, 
6141-6). Briefly, synthesis was performed in a stepwise manner from Boc-(i-Alanine- 
Pam resin. Polyamides were then cleaved by reaction with ((dimethylamino)propyl)- 
amine and purified by HPLC chromatography. 
25 To prepare labeled DNA, plasmid DNA was digested with Hindlll and EcoRX for 

gel shift analysis, or Pvull and EcoRl for footprinting analysis, and simultaneously 
labeled with Sequenase 2.0, deoxyadenosine 5 , -(a- 32 P)-triphosphate, thymidine 5'-(a- 32 P)- 




triphosphate, and nonradioactive deoxynucleoside triphosphates. The fragment 
waspurified by gel electrophoresis, treated with proteinase K, filtered, further extracted 
with phenol/chloroform, and precipitated with ethanol. 

For DNase I and MPE footprint reactions, all reactions were equlibrated at 22°C, 
5 pH 5.5, in the presence of 45 mM MES, 1 mM MgCl 2 , and labeled DNA for at least 24 
hours. Footprinting reactions were carried out as previously described (Hertzberg, et al. 
41384)>Biochemis^ Natl. Acad : Sci., 

USA 83, 8462-8466.). 

To perform gel shift assays and titrations, all samples were equilibrated as above 
10 for at least 24 hours. One-tenth volume of 15% glycerol loading buffer was added and 
Q samples were run on a 10% polyacrylamide gel at 40°C, pH 5.5, with a 75:1 

jTj acrylamide/bis-acrylarnide ratio in 45 mM Mes and 1 mM MgCl 2 with buffer 

J recirculation. Quantitation of isotherms was performed by plotting the ligand 

concentration against the portion of labeled DNA in the bent conformation and curve fit 
:! P 15 using a Langmuir binding isotherm, as previously described (Brenowitz, et al. (1986) 
□ Proc. Natl. Acad. Sci., USA 83, 8462-8466; Bailly, et al. (1995) J. Molecular Biology 

U 253, 1-7). 

]q . Two 15 bp purine tracts separated by one tum of the DNA helix (10 bp) were 

"** targeted by oligonucleotides containing two pyrimidine tracts (T and *C) connected by a 

20 central T linker of size 2-9. It has been previously shown that such oligonucleotides bend 
DNA to a varying degree dependent upon the size of the linker (Liberies, D. A. & 
Dervan, P. B. (I 996) Proc. Natl. Acad. Sci., USA 93, 9510-4). The intervening 10 bp 
not targeted by the third strand oligonucleotide can be bound by a polyamide specifically 
designed for that sequence, as depicted in Figures 13 and 14. The binding affinity of the 
25 polyamide for its target sequence has previously been determined under similar 
conditions and reported to be 3.7 x 10'° M* 1 for PA1 and 5.0 x 10 8 M" 1 for PA2, both 
much stronger than the affinity of either the unlinked 1 5mers for their sites, or the linked 
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bidentate oligonucleotides (Liberies, D. A. & Dervan, P. B. (1996) Proc. Natl. Acad. 
Sci., USA 93, 9510-4); Liberies, D. A. & Dervan, P. B. , unpublished data; Trauger, J. 
W., Baird, E. E. & Dervan, P. B. (1996) Nature 382, 559-61). Given the ability of 
polyamides to straighten DNA, it is demonstrated herein that targeting a polyamide to the 
5 intervening duplex of bent DNA can straighten it, displacing a ligand bound several base 
pairs distally, as depicted in Figure 15. 

-PA1 at a-concentration -of 400 pM ^an^isplace-^^DNA..bending*.thkd»strand 
oligonucleotide at a concentration of 1 ^M. This effect is clearly seen in Figure 16 for 
oligonucleotides with linkers of 2 and 3 T residues, bending DNA greater than 600. In 

10 these lanes, the more retarded bent structure is shifted to a less retarded structure with 
mobility similar to a double Y structure, where two oligonucleotides are bound by a 
single target DNA molecule with the polyamide presumably bound between them. For 
oligonucleotides with smaller bend angles, the polyamide does not displace the third 
strand oligonucleotide and the DNA distortion is probably not sufficient to preclude 

15 efficient polyamide binding. The specificity of this effect is seen in Figures 17A and 
17B, where only PA1, but not PA2 can displace oligonucleotide 2 at a concentration of 
100 pM, while neither can displace oligonucleotide 9. 

These experiments were performed with simultaneous addition of polyamide and 
oligonucleotide. Next, we sought to determine if the order of addition was important 

20 given the reported half life of a bound third strand oligonucleotide on DNA at 
approximately 12 hours under near physiological conditions (Maher, et al. (1990) 
Biochemistry 29, 8820-8826). As shown in Figure 18, simultaneous addition, 
preincubation with polyamide for one hour, or preincubation with third strand 
oligonucleotide for 1 hour made little difference in the ability of the polyamide to inhibit 

25 binding of the third strand oligonucleotide, where 1 hour is less than expected association 
rate for both oligonucleotide and polyamide (Maher, L. J., Dervan, P. B. & Wold, B. J. (I 




990) Biochemistry 29, 8820-8826; Albert, et al. (1997), submitted). As such, polyamides 
may be useful in targeting prebound transcription factors in cells. 

It has been shown that the energy required to bend DNA with bidentate triple 
helical ligands is less than predicted by theoretical models of DNA as a smoothly bending 
5 wormlike chain with coulombic repulsion from phosphates placed at fixed distances 
(Akiyama, T. & Hogan, M. E. (1996) J. Biological Chemistry 271, 29126-29135; 
-- Liberies, D A^Dervan, P. B . unpublished data;. .Bloom-field,- £t -aL ( 1 974.) in,Ehysical 
Chemistry of Nucleic Acids (Harper and Row, New York), pp. 159-166; Fenley, et al. 
(1992) J Physical Chemistry 96, 3963-3969). However, the prebending of target DNA 
10 for TBP, a general transcription factor, significantly altered its binding affinity (Parvin, et 
Q al. (1995) Nature 373, 724-727). The effects of prebending on the affinity of the 

iTj polyamide by measuring the Ka, against triplex mediated bending was determined, 

g Figure 17 shows a sample gel shift titration, where the measured Ka, for PA1 is 6.2 (±0.3) 

; 5 x 10 Al °, whereas K, for PA2 is <1 x 10 9 M" 1 . Ka, for PA1 is within experimental error of 

P 15 Ka measured by DNase 1 footprinting and shows that the energy for straightening DNA 
□ bent by a ligand with a lower binding affinity is minimal (Trauger, et al. (1996) Nature 

U 382, 559-61). Bending effects with TBP may be much larger, given the large number of 

\q protein-DNA contacts supporting the bent structure (Kim, et al. (1993) Nature 365, 512- 

"' U 520; Bond, et al. (1994) Biophysical Journal 67, 825-836). Additionally, TBP-bound 

20 DNA is bent towards the major groove, not the minor groove. 

To confirm that binding of the polyamide is indeed in the intervening duplex, 
DNase I and MPE footprinting were performed, as shown in Figure 17. While no binding 
in the intervening duplex and no displacement of the third strand oligonucleotide are seen 
for PA2, PA1 dislodges the third strand oligonucleotide, while protecting the intervening 
25 duplex. This supports the model of displacement of the bending ligand. 

The mechanism of action of polyamides is assumed to be directly through 
regidification of the double helix. Alternative modes of action to be considered are 




simple steric blockage, or disruption of the solvation shell or counterion shell. Modeling 
of the triple helix-mediated bend shows a linker that is displaced from the intervening 
duplex to accomodate the bend angle, where shorter linkers are displaced further from the 
duplex than longer linkers. The inability of PA1 to displace bending oligonucleotides 
5 with longer linkers and bend angles less than 600 degrees argues against this explanation. 
Disruption of the salvation shell in the minor groove is likely to be steric and therefore 
« unlikely -to -extend into -the triple -helical region where .no .polyamide is .bound. 
Furthermore, this minor groove effect is unlikely to effect binding of a third strand in the 
major groove, where simultaneous binding has previously been demonstrated (Parks, et 
10 al. (1996) Bioorganic & Medicinal Chemistry 4, 1045-50). Similarly, the high charge 
□ density of double helical and triple helical DNA results in a large counterion shell around 

jTj the molecule and in a very short Debye screening length where charge effects are unlikely 

J to extend for multiple base pairs (Bond, et al. (1994) Biophysical Journal 67, 825-836; 

^ Philpott, et al. (1995) J. Electrochem. Soc. 142, L25-L28.). 

: f 15 By modifying DNA structure, generalizable sequence-specific polyamides have 

Q been designed to displace a DNA bending ligand at an adjacent but nonoverlapping 

binding site. This ability to displace DNA bending ligands through rigidification may be 

ill ... 
i A useful in the design of polyamides as artificial regulators of gene expression, providing a 

"zsr 

~ potentially valuable tool in molecular biology and human medicine. 

20 

As disclosed herein, the present invention provides the reagents and 
methodologies for the preparation and use of a variety of new polyamides comprising 
positive patch sequences for specific recognition of DNA in the minor groove and 
inhibiting the function of DNA-binding proteins that bind the major groove. Also 
25 provided is a methodology for determining the mechanism of action of inhibition of 
DNA-binding molecules. While a preferred form of the invention has been shown in the 
drawings and described, since variations in the preferred form will be apparent to those 




skilled in the art, the invention should not be construed as limited to the specific form 
shown and described, but instead is as set forth in the claims. 
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